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This  final  report  of  grant  AFOSR-80-0190  covers  the  second 


year  of  the  grant  and  is  divided  into  five  sections.  Section 
one  gives  a  brief  summary  of  the  motivation  of  the  effort.  Sec¬ 
tion  two  reports  our  progress  this  year.  Section  three  summar¬ 
izes  the  report.  The  references  are  collected  in  section  four; 
a  list  of  publications  and  presentations  during  the  first 
year's  effort  appears  in  section  five;  one  for  this  year's 
effort  appears  in  section  six.  Related  work  is  cited  throughout 


section  two;  however,  a  complete  discussion  of  related  work 


appears  in  Weischedel  (1980)  and  in  Weischedel  and  Chester 


(1983) . 


1.  Motivation  of  the  Work 

The  technique  of  formal  module  specifications  seems  to 
offer  much  toward  alleviating  many  problems  of  large  software 
systems  (those  systems  requiring  at  least  25  progreunmers  for 
development  and  at  least  30,000  lines  of  source  code).  The  high 
cost  of  software  maintenance,  the  predominance  of  design  errors, 
the  difficulty  in  modifying  software,  and  the  difficulty  and 
cost  of  diagnosing  and  correcting  design  errors  are  some  of  the 
problems  addressed  by  formal  specifications  based  on  the 
information-hiding  principle.  Yet,  the  creating  of  formal 
specifications  is  very  difficult,  requiring  much  upfront  effort. 
For  instance,  Parnas  (1976,  p.  7)  states,  "Experience  has  shown 
that  the  effort  Involved  in  writing  the  set  of  specifications 
can  be  greater  than  the  effort  it  would  take  to  write  one  com¬ 
plete  program."  It  is  also  generally  agreed  that  formal  specifi¬ 
cations  are  difficult  to  understand. 
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Our  work,  which  has  been  funded  by  AFOSR  under  contract 
number  P49620-79A)131  and  grant  number  AFOSR-80--0190,  has  had 
both  long-term  and  short-term  goals.  The  long-term  objectives 
of  the  final  year  of  this  effort  have  been  a  preliminary,  fun¬ 
damental  study  of  the  problems  involved  in  understanding 
software  system  requirements  written  in  English  and  transforming 
them  into  formal  specifications  of  a  software  module.  The 
research  has  culminated  in  a  prototype  system.  A  short-term 
objective  of  this  final  year  of  the  effort  has  been  to  seek  a 
more  understandable  alternative  to  first-order  quantification  in 
formal  specification  languages. 

The  motivation  of  the  work  is  covered  at  greater  length  in 
the  proposal  for  the  grant  and  the  contract  mentioned  above. 

2.  Results 

The  proposal  for  this  present  year's  effort  included  three 
areas;  one  in  short-term-goals  and  two  in  long-term  goals. 

2.1  Short-term  Goals 

The  formal  specification  of  a  module  must  be  understandable 
if  it  is  to  achieve  its  purpose,  for  it  acts  as  a  contract 
between  designers  and  programming  team,  stating  exactly  what  the 
programming  team's  product  must  do  (Parnas,  1977).  Unless  they 
are  understandable,  1)  programmers  will  not  know  what  the  modu¬ 
le  they  are  to  implement  is  to  do  nor  how  to  use  other  modules, 
and  2)  designers  will  not  be  able  to  detect  design  errors  nor 
easily  confirm  that  their  design  satisfies  user  requirements. 
Also,  if  one  is  to  use  a  reference  library  of  formal  specifica¬ 
tions,  they  must  be  understandable,  for  if  the  designer  cannot 


understand  the  alternative  specifications,  how  can  an  intelli¬ 
gent  choice  be  made  among  the  alternatives?  If  formal  specifica¬ 
tions  of  module  interfaces  are  to  become  widely  used,  they  must 
be  understandable. 

The  proposal  for  this  year's  work  identified  quantification 
in  formal  logic  both  as  a  source  of  difficulty  in  understanding 
formal  specifications  and  as  a  mechanism  for  which  an  alterna¬ 
tive  could  be  found.  Though  we  originally  anticipated  that  a 
new  specification  language  would  be  needed,  we  found  an  interes¬ 
ting  alternative,  namely,  Horn  clauses,  which  had  never  before 
been  applied  for  software  specification,  though  they  had  been 
used  in  logic  programming.  The  definition  of  Horn  clauses  and  a 
discussion  of  their  applicability  is  presented  in  Weischedel  and 
Chester  (1983);  a  review  of  the  use  of  Horn  clauses  is  given  in 
Chester  (1982).  We  do  not  believe  them  to  be  the  ultimate 
specification  language;  nevertheless  they  have  many  advantages, 
including  a  highly  understandable  treatment  of  quantification. 

2.2  Long-term  Goals 

The  two  sections  that  follow  deal  with  the  long-term  goals. 
In  particular,  Weischedel  (1980)  identifies  the  form  a  tool 
might  have,  which,  under  significant  user  direction,  would 
derive  formal  specifications  from  English  requirements.  A 
second  long-term  goal  is  anaphora  usage,  a  well-known  unsolved 
problem  in  natural  language  processing.  We  have  developed  a 
small  prototype  of  such  a  system. 

The  structure  we  have  chosen  for  the  prototype  tool  is 
given  in  Figure  1.  All  arrows  represent  data  flow. 


SYSTEM  STRUCTURE 
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Given  an  English  input,  the  RUS  parser  (Bobrow,  1978) 
recognizes  the  syntactic  structure  of  it.  Using  a  lexicon  (a 
highly  formalized  dictionary),  the  parser  generates  a  semantic 
representation  for  the  input.  Since  the  syntax  of  English  is 
highly  ambiguous,  the  parser  interacts  with  the  knowledge  base 
in  the  "conceptual  view"  as  a  means  of  using  factual  knowledge 
to  reduce  the  number  of  semantically  senseless  parses  tried  by 
the  parser.  The  parser  and  dictionary  constitute  the  syntactic 
component  of  the  system. 

The  semantic  component  applies  simple  semantic  constraints 
to  filter  out  anomalous  interpretations  during  parsing.  For 
example,  in  this  environment,  a  process  or  a  person  may  delete  a 
data  entity  from  a  data  strucutre;  any  parse  of  a  clause  invol¬ 
ving  deletion  must  conform  to  that  sense.  The  operation  of  the 
parser  and  semantic  component,  as  well  as  our  experience  with 
them  are  reported  in  Weischedel  and  Chester  (1983). 

Much  of  our  effort  in  semantics  has  been  designing  the  map¬ 
ping  component  and  the  input  and  output  representations  for  it. 
The  input  to  the  mapping  component  is  a  set  of  Horn  clauses 
whose  predicates,  functions,  and  constants  are  those  of  the 
user.  The  motivation  for  having  two  levels  of  representation  is 
detailed  in  Weischedel  and  Chester  (1983).  'The  design  of  the 
mapping  component  is  one  of  the  most  important  contributions  of 
our  research,  since  few  have  considered  these  issues  before. 
Our  work  has  provided  a  basis  for  solutions  to  problems  in  in¬ 
terpreting  spatial  metaphor,  determining  the  meaning  of  known 
words  in  new  environments,  and  recognizing  semantic  paraphrases. 
The  results  are  reported  in  Weischedel  and  Chester  (1983). 


Though  we  had  originally  planned  a  small  effort  in  genera¬ 
ting  clarifying  questions,  we  decided  the  effort  would  be  better 
spent  on  the  more  unique  aspects  of  our  research.  Several  lar¬ 
ger  research  efforts  (Mann,  et  al .  1981)  are  underway  in  natural 
language  generation,  including  one  (Swartout,  1982)  in  genera¬ 
ting  English  paraphrases  of  formal  specification. 

2 . 2  Anaphora 

In  addicion  to  the  semantic  analysis  discussed  above,  a 
separate  problem  in  semantics  was  studied.  Determining  what  a 
pronoun  or  noun  phrase  refers  to  has  been  a  topic  of-  much  in¬ 
terest  recently  in  artificial  intelligence;  it  is  called  defini¬ 
te  anaphora  resultion.  Sidner  (1979)  presents  a  heuristic  for 
detecting  clues  in  dialogue  for/  the  focus  of  attention,  and 
explains  how  that  can  be  used  for  determining  reference.  The 
consultant  on  this  grant  has  been  modifying  this  heuristic  for 
texts  of  English  specifications  gather  than  dialogue;  see  Joshi 
and  Weinstein  (1982)  and  Grosz,  et  al .  (1983).  In  doing  this,  he 
has  found  that  Sidner 's  not ion. of  focus  is  inadequate.  Rather, 
two  objects  seem  to  serve  as  centers  of  attention.  A  backward 
center  serves  as  a  focus  in  a  single  sentence  and  corresponds  to 
previous  objects  in  the  text.  A  forward  center  serves  as  a 
focus  for  later  references  and  introduces  new  objects.  In  the 
sentence  "Processes  must  execute  in  a  single  fork",  "processes" 
serves  as  the  backward  center,  and  "a  single  fork"  is  the 
forward  center. 


3.  Conclusions 

The  effort  for  this  year  involved  three  areas  of  work.  In 


•the  first  ares  we  investigated  Horn  clauses  as  a  more  understan¬ 
dable  alternative  to  logical  quantifiers.  The  second  area  was 
developing  an  experimental  software  tool  to  test  our  solutions 
to  problems  in  transforming  English  descriptions  to  formal 
specifications  under  significant  user  assistance.  A  third  prob¬ 
lem  is  anaphora  resolution;  the  heuristics  of  Sidner  (1979)  have 
been  extended . 


This  application  of  natural  language  processing  has  the 
following  open  problems: 

1.  formal  specification  languages  with  richer  semantics  so  that 
the  level  of  the  target  language  is  closer  to  that  of  natural 
languages, 

2.  development  of  more  flexible,  forgiving  natural  language 
interfaces  (Weischedel  1983)  that  have  partial  understanding 
even  of  poorly  formed  input, 

3.  extension  of  the  technology  to  broad  areas  of  specification, 

4.  continued  development  of  high  quality  English  generation 
components  both  for  creating  a  paraphrase  of  the  formal  specifi¬ 
cation  and  for  generating  questions  to  clarify  ambiguous  or 
vague  aspects  of  the  English  definitions,  and 

5.  further  development  of  the  mapping  phase. 
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